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site is often included in the vector just preceding 
.* ing t which the protein coding sequence is to be placed. 
T° n 3 uences frequently placed in vectors for their efficient 
deSeq include chemically synthesized optimal consensus 
S |1ie T7 gtO-L region, or those of other highly translated 
. \ jg^ese standard translation initiation systems are suit- 
adequate expression of most proteins, the sequence in 
{ ^ t0 b e optimized to gain the highest level of protein 
Translational limitations can also occur as a result 
J^ific nature of the protein coding region (e.g., unusual 
-5r n un i q ue codon usage, or special secondary structures), 
pioblems cannot be addressed at the expression vector level. 

restriction Sites for Cloning 

selection of the location of the restriction endonuclease 
site at which the protein coding sequence is to be intro- 
into the plasmid can allow the expression of the protein in 
st useful form. In many cases it is desired to produce the 
"ete intact natural protein, beginning with its N-terminal 
acid and ending with the native C-terminal amino acid. This 
tly is the case when the exact structure of a mature, natural, 
utically active protein is needed. To position the coding 
.2 correctly with respect to the ribosome binding site, a 
^restriction endonuclease site incorporating an ATG se- 
jsused to place the translation initiation codon appropriately 
Vector. Two commonly used sites that contain an ATG and 
ixist in any given DNA sequence are Nco I and Nde I. A 
"Unique restriction site is often used where the C-terminal 
the gene is joined to the vector to orient the protein coding 
iduring cloning. Such a construct would then enable transla- 
te complete cloned protein coding region. 

gh not all mature proteins have an N-terminal methionine, 
$e primary product formed in bacterial systems. The extra 
icid can be removed from the protein after synthesis by in 
pthods, or the desired protein can initially be made as part 
ger peptide chain with subsequent cleavage by a specific 
"to generate the N-terminus of the mature protein product. 
% of N-terminal protein carrier elements have been used 
protein fusion products. In the formation of a fusion 
' the vector carries the appropriate signals for translation 
pn/as well as a coding region of the N-terminal component 
^protein and a restriction endonuclease cleavage site where 
'fragment bearing the foreign gene can be placed in the 
reading frame. The properties of the N-terminal component 
protein fusion can be used to assist in purification of the 
;protein. For example, if this segment encodes an easily 
protein such as the maltose binding protein or glutathione 
"'erase, the combined fusion product can be readily isolated 
^affinity chromatography. The addition of tracts of basic, 
or metal-binding amino acids in the carrier protein can also 
J to aid purification. This protein fusion concept can be 
to allow for protein processing in vivo. For example, if 
^nninal segment encodes a localization signal, the protein 
^ directed to the periplasmic space of E. coli or, in some 
Wo the extracellular medium. This type of export can aid in 
""^cation and stabilization of the protein. A vector employing 
1 A as the carrier is one of this type, 
^orne applications the expression of the entire protein is not 
For raising antibodies or for the detection of an antigen 



by antibodies, only a recognizable epitope is necessary, so in these 
cases expression of only a short protein segment is sufficient. The 
cloned segment comprising the antigenic site can be attached to a 
protein carrier segment that will allow effective exposure of the 
foreign peptide. Placement of the foreign segment at the C-terminal 
end of p-galactosidase has been used effectively for this purpose. 
In some cases a suitable surface protein can be used as the carrier 
portion to ensure that the foreign protein segment is localized to 
the surface of a cell or bacteriophage particle, thus presenting the 
immunologically active antigen or antibody species in a way that 
allows detection, isolation, or use of the living cell or virus particle. 

3 HOST CELL FEATURES 
3.1 ESCHERICHIA COLI 

The use of the correct host can have significant impact on the final 
yield of the desired protein product. Factors generally useful to 
the host include the ability to grow rapidly and to a high density, 
the ability to be transformed in a manner that facilitates the introduc- 
tion of the DNA construct, and the possession of a low recombina- 
tion and mutagenic rate, to ensure that the plasmid is not frequently 
lost, damaged, or otherwise inactivated. Stability to degradation is 
a major problem in the production of certain proteins. Therefore, 
host cells with reduced protease levels have been used to enhance 
the in vivo stability of the foreign protein of interest. Commonly 
used mutations in E. coli that have reduced protein degradation 
rates because of the inactivation of proteases are Ion, rpo H, and 
dp. In some cases the protein stability can be addressed at the 
expression vector level by using a fusion protein construct. The 
presence of the longer carrier protein often will effectively stabilize 
a small foreign peptide segment. Translation of large proteins is 
also a limitation in many strains; however other £. coli strains may 
have an increased ability to translate the particular product. 

3.2 Other Hosts 

While E. coli continues to be the most widespread expression 
system, other host-vector systems have certain advantages. The 
strengths of the E. coli system are the variety of vectors and 
specifically altered hosts available and the well-studied methods 
for manipulating this organism. High levels of production can 
be attained. However protease problems, formation of inactive 
inclusion bodies containing the product, and the lack of a eukaryotic 
glycosylation system limit production of a number of proteins from 
mammalian sources. Other bacteria have received some attention 
owing to their ability to grow on particular compounds, their poten- 
tial for secretion of the protein, or their industrial potential. 

Yeasts are a suitable production system for a number of pro- 
cesses. Not only are they well studied like £. coli, and amenable 
to scale-up, but they are able to carry out some posttranslational 
modification of eukaryotic proteins. 

Fungi can produce high yields of commercial proteins and are 
reasonably capable of glycosylation and secretion of proteins into 
the medium. Methods have not been so completely developed, and 
the use of organisms of this class has not been widespread. 

Viruses that infect insect cells (baculoviruses) have gained atten- 
tion as a system for producing glycosylated proteins at reasonably 
high levels but at less expense than is incurred using mammalian 
cells. The vaccinia virus has been used to express foreign antigens 
in whole animals, demonstrating its potential for use as a vaccine 
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vector system. Although mammalian cells have been studied exten- 
sively and have the advantage of producing truly identical processed 
mammalian protein, the costly, difficult scale-up has limited com- 
mercial production with this system. 

4 PERSPECTIVES 

The ability to clone and express high levels of proteins in other organ- 
isms has led to great advances in the speed and detail with which 
biological systems can be analyzed. With the expansion of this tech- 
nology to other organisms and the construction of more complicated 
and sophisticated derivatives of currently studied systems, the im- 
pact of this area will increase. Recombinant protein production has 
now begun to see use in applications to industrial microorganisms, 
the formation and use of transgenic plants and animals, and the analy- 
sis of unknown proteins associated with genetic disorders. 

See also DNA Markers, Cloned; £. COLi Genome; 
Fungal Biotechnology; Gene Expression, Regula- 
tion of; Plasmids; Yeast Genetics. 



Extracellular Matrix Material lying adjacent to and 
cells. 

Gene Structure Organization of the gene, including the 
exons, and introns. 

Intron DNA of the gene that is removed from the 
RNA product. 

mRNA Splicing The removal of introns from the p- 
and joining of exons in mature RNA; thus introns are 
out, while exons are spliced together. 



The extracellular matrix is necessary for development and 
functioning of all cell types in an organism: it is made up of 
proteins and glycoproteins whose complex interactions * 
the matrix properties. The composition of the extracellular 
is controlled at the level of gene expression by provi- 1 * 
quantity of mRNA sufficient to produce adequate proteins 
alternative splicing of pre-mRNA to generate proteins, 
proper functional domains. 
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Key Words 

Exon DNA Of the gene that is represented in the mature RNA 
product. 



1 INTRODUCTION 

The extracellular matrix (ECM) of all tissues is a complex 
of secreted proteins that collectively play a critical role uv 
ing and maintaining tissue function. The ECM is located j 
around connective tissue cells and under epithelia. ECMr 
range from the multifunctional fibronectins and throm" ' 
to the large families of collagen types, proteoglycans, and 
among others. Fibrillar collagens types I, II, and III are the 
ECM proteins that confer the structural characteristics^ 
tissues such as bone, skin, blood vessels, and cartilage. Q£ 
proteins such as the fibronectins, laminins, and tenascin pla,. 
roles in cell-cell interactions, cell migration, andcytoskef 
nization. The structure and function of extracellular matrix- 
have been reviewed recently in two very informative 
Hay (1993) and Kreis and Vale (1993). While not incl" 
contribution discusses the molecular biology of the major 
of ECM components. 



1.1 Regulation of ECM Gene Expression 

It is becoming increasingly clear that the regulation of E 
expression, both transcriptionally (to regulate quantity) 
transcriptionally (using alternative splicing of mRNA to 
or remove functional domains), is of crucial importarr 
morphogenesis and cell differentiation, cell migration and^ 
tion, wound healing, and disease processes such as fib^ 
arthroses. ECM expression is regulated by a wide variety 
factors and cytokines, being generally stimulated by, for? 
transforming growth factor beta, and insulin-like growuj 
I and II, and inhibited by interleukin I and interferon 
Interestingly, as more information accumulates regarding 
tion of ECM expression, it is apparent that ECM molecules^ 
independently controlled in their expression and that reg4! 
dependent on cell type. Table 1 shows some examples of^ 
of ECM molecules by cytokines. This is a rapidly emer 
and more information is published every day. It is now I 
certain ECM molecules can effect the expression of the 



